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ISOLATED NUCLEIC ACIDS ENCODING A SECRETORY SIGNAL FOR 
EXPRESSION AND SECRETION OF HETEROLOGOUS RECOMBINANT PROTEINS 



FIELD OF THE INVENTION 

The present invention relates to the use of nucleic 
acids (termed "SS" for "secretory signal" or "Vtgss" for 
"vitellogenin secretory signal") that function as universal 
secretory signals for rapidly directing recombinant proteins 
out of a heterologous host cell. The invention is also 
applicable to improved and more convenient means of 
detecting reporter proteins by simply sampling culture 
medium in which cells harboring constructs of the invention 
are maintained. The invention is also applicable to 
biosensor technology, to for screening of genetically- 
engineered clones harboring selectable marker (s) to be 
expressed as an indicator, and to achieving systemic 
circulation/distribution of recombinant proteins in a 
transgenic or chimeric host organism. 



DESCRIPTION OF THE RELATED ART 

Although protein expression in bacteria is commonly 
used to obtain recombinant proteins, bacterial hosts present 
several drawbacks to expression of recombinant proteins. 
First, eukaryotic proteins expressed in E. coli are often 
not properly folded and thus non-functional (Cleland, J. L. 
1993, In Protein Folding In Vivo and In Vitro: pp. 1-21) . 
This is a large problem, since most proteins used for 
pharmaceutical/diagnostic purposes are eukaryotic in nature. 
Second, proteins expressed in large amounts in E. coli often 
precipitate into insoluble aggregates called "inclusion 
bodies", from which they can only be recovered in an active 
form by solubilization in denaturing agents followed by 



Attorney 



Docke 



1781-178P 




-2- 



careful renaturation (Chang and Swartz, 1993, in "Protein 
Folding In Vivo and In Vitro", Cleland, J. L . ed., pp. 178- 
188) . Third, it is difficult to engineer the secretion of 
large amounts of expressed protein in E. coli, although it 
has often been possible to secrete small amounts into the 
periplasmic space and to recover them by osmotic shock. 
Finally, the recovered recombinant protein is usually 
contaminated with an unacceptable level of endotoxin/pyrogen 
that makes proteins for pharmaceutical /diagnostic use 
extremely difficult to validate (Walsh and Headon, 1994, in 
"Protein Biotechnology", pp. 118-162) . 

Although, protein expression in yeast ( Saccharomyces 
cerevisiae) can solve some of the problems associated with 
bacterial expression, the harsh conditions necessary to lyse 
the yeast often result in poor yield and denaturation of the 
recombinant proteins. Secretion of recombinant proteins from 
yeast has largely been a trial and error effort. 
Furthermore, most secreted proteins are trapped in the yeast 
periplasmic space and thus, difficult to retrieve (Lemontt 
et al., DNA -4:419-428 (1985); Martegani et al., Appl . 
Microbiol. Biotechnol. 37:604-608 (1992)). 

The mammalian and insect expression systems have a 
number of advantages that have contributed to the recent 
popularity (Straten et al., 1989, in "Invertebrate Cell 
Systems Applications", pp. 183-193; Yarranton, G. T., Curr. 
Opinion Biotech. 1:133-140 (1990)). Recombinant proteins are 
almost always expressed at high levels and the expressed 
proteins are usually properly modified and accumulated in 
the proper cellular compartment. However, there have been 
occasions where the use of the homologous secretory signal 
of the gene insert resulted in mistargetting of the 
expressed proteins, which consequently remained within the 
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yeast. Generally, protein expression in higher eukaryotes 
such as insect cells and mammalian cells is challenging, 
time-consuming and expensive although they are suitable 
hosts for small- and medium-scale work. 
5 An example of a Drosophila expression system utilizes a 

cell line derived from Drosophila melanogaster , Schneider 2 
(S2) cells, and a simple plasmid vector for the expression 
of heterologous proteins (Ivey-Hoyle, M . , Curr. Biol. 2:104- 
707 (1991)). S2 cells and mammalian cells such as CHO cells 

10 have been used to express proteins for both biochemical 
assays and therapeutics. In both the insect and mammalian 
cell systems, there is a therefore a need to employ powerful 
secretory signals to drive the secretion of the expressed 
recombinant proteins out of the heterologous host cells for 

15 easy detection, quantification and large-scale purification. 

For an efficient expression system, several criteria 
need to be satisfied. First, the recombinant protein 
synthesis must be functional. Second, a reasonable yield of 
the protein is required to justify its usage. Third, the 

20 system must enable homogenous synthesis of the recombinant 
protein. Last, the system should allow easy scaling-up and 
subsequent downstream processing, for example, purification. 
Consequently, we have constructed recombinant fusion vectors 
that direct the secretion of heterologous proteins into the 

25 medium used to culture various host cells. 

An important aspect of the study of gene regulation is 
assay of transcription levels observed from wild-type and 
mutant versions of putative cis-acting control elements in 
transfected mammalian cells. In most cases, the ability of 

30 the cis-acting elements to modulate transcription is not 
assayed directly. Instead, the cis-acting elements are 
joined to a reporter gene that codes for a measurable enzyme 
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activity. The level of the enzymatic activity that 
accumulates during the course of an experiment is taken as a 
measure of the ability of the cis-acting element to regulate 
transcription. Although this is an indirect assay of the 
5 activity of the cis-acting elements that control 
transcription, it is often the only one that is practicable. 
A number of different genes have been used as reporters of 
the transcriptional activity of eukaryotic promoters. These 
include chloramphenicol acetyltransf erase (CAT) , p- 
10 galactosidase and lucif erase. However, all these require 
lysis of the transfected cells and thus are subject to the 
problems associated with set-up and variability of multiple 
cultures . 

To date, the Great EscAPE SEAP Genetic Reporter 
15 System™ (Clontech) is one available system that utilizes a 
secreted form of an enzyme, human placental alkaline 
phosphatase, as a reporter for the analysis of cis-acting 
DNA sequences and trans-acting factors. However, a drawback 
of the SEAP assay is that it has to be measured using either 
20 a fluorescent or chemiluminescent assay, requiring a 
fluorimeter or a luminometer. Furthermore, it requires the 
investigator to switch from their established protocols for 
CAT, p-galactosidase or lucif erase. 

SUMMARY OF THE INVENTION 

25 The present invention stems from the finding that the 

secretory signal sequence (SS or Vtgss) of the vitellogenin 
gene, especially the vitellogenin gene from Oreochromis 
aureus, can be used to direct the secretion of recombinant 
proteins out of cells. The cells that are able to utilize 

30 the O. aureus SS are not particularly limited;- the present 
inventors have found that cells of many kinds, especially 
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eukaryotic cells, will efficiently process exogenous fusion 
proteins comprising the SS of the invention and secrete them 
from the cell. 

Thus, one aspect of the present invention is isolated 
5 nucleic acids that encode a SS that provides the biological 
activities of directing secretion of a protein linked to the 
SS from a host cell and cleavage of the SS from the fusion 
protein at a particular point in the SS sequence. The 
isolated nucleic acids of the invention are easily 
10 incorporated into vectors for expression of recombinant 
proteins and subsequent secretion of the fusion protein, 
processed by appropriate cleavage of the SS portion, into 
the medium used to culture the host cells. An advantage of 
the present invention is that the secretion is very 
15 efficient regardless of the particular protein secreted or 
the host cell used to express the recombinant protein. 

Thus, the present invention resides in part in the 
development of an efficient and versatile vector for high 
level expression and immediate secretion of recombinant 
20 proteins into serum-free medium used to culture the host 
cells. The host cells can be immobilized, or else easily 
removed from the culture, thus providing easy purification 
of the recombinant proteins. Also, vectors comprising the SS 
of the invention and a reporter gene will express and 
25 efficiently secrete the reporter proteins, thus enabling 
easy detection of the reporter protein, for example by 
sampling of the culture medium, or of an extracellular fluid 
in the instance of an in vivo assay. 

As a result of the efficient processing conferred by SS 
30 of the present invention, the invention provides assays of 
gene expression levels by measuring the amount of a reporter 
gene that is secreted into medium used to culture cells 
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harboring vectors of the invention. An advantage of such an 
assay is that cells expressing the reporter gene need not be 
lysed to perform the assay, allowing the assay to be used 
for screening of clones for expression of recombinant 
constructs or that are co-transformed with additional 
vectors . 

The invention also encompasses biosensors comprising 
cells that are transformed with vectors according to the 
present invention. Such biosensors can be applied to 
detection of compounds present in samples by addition of the 
samples to media used to culture cells of the invention, 
followed by measurement of the activity of reporter proteins 
in the culture media whose expression is induced or 
repressed by the introduction of the compound into the 
medium. 

The present invention can also be applied to the 
expression of desired proteins in transgenic or chimeric 
organisms in vivo. In some instances, it is desirable to 
obtain secretion of a protein into the extracellular spaces 
in an organism, including extracellular matrix, various 
secretory ducts, peritoneal fluid, ventricular fluid, the 
bloodstream or the lymphatic system. The present invention 
can be applied to directing secretion of a desired protein 
into any of these extracellular spaces. 

Control of expression of a desired gene, followed by 
secretion of the gene product from the cell, for example in 
a tissue-specific manner, or in a manner inducible by 
treatment with a particular chemical compound, can be 
achieved by linkage of an appropriate promoter to a 
structural gene comprising nucleotides encoding the SS of 
the invention linked to a nucleotide sequence encoding the 
desired gene product. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the construction of pAc5/VtgCrFCES-V5- 
His. The full-length CrFC21 cDNA encoding Factor C was 
released by a Bgl II and Not I double digest. The released 
5 fragment was subcloned into the Bam HI and Not I sites of 
VtgEGFP. Consequently, the Bgl 1 1 /Bam HI site is destroyed. 
Then the LPS-binding domain of CrFC21 was released by 
digestion with Eco47 III and Sal I (cutting an internal 
site) . The fragment was subcloned into the Eco RV and Xho I 



10 sites of pAc5/V5-HisA to give pAc5/VtgCrFCES-V5-His . Both 
Eco47 III/Eco RV and Sal I /Xho I sites are destroyed.^^ 



Figure 2A shows the amino acid, and nucleotide sequences/ 

\ 1 
at the junction of Vtgss and CrFCES. The nucleotide 

junctions were determined by sequencing using the Ac5 

15 forward primer and pcDNA3.1/BGH reverse primer. The CrFCES 

is cloned in-frame with respect to Vtgss (at the 5' end) and 

V5-His (at the 3' end) . The secreted VtgCrFCES protein was 

purified by affinity chromatography (TALON™, Clontech) under 

denaturing conditions ( 6M urea, 250 mM NaCl and 20 mM sodium 

20 phosphate buffer, pH7.0). The protein was transferred to a 

PVDF membrane in transfer buffer (10 % methanol, 10 mM CAPS, 

pH 11.0) and its N-terminal amino acid sequence determined. 

Using the novel Vtgss as secretory signal, only a single 

cleavage point was identified (indicated by the arrow) . 

25 Thus, Vtgss allows the homogenous production of secreted 

heterologous recombinant protein. 

Figure 2B shows a map of the plasmid pAc5 /VtgCrFCES-V5- 

His . 





30 



Figures 3A and 3B show the distribution of secreted 
VtgCrFCES protein . 

Figure 3A - culture medium 



Lane M: Benchmark prestained marker 
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Lane 1: control medium (30 pg) 

Lane 2: VtgCrFCES medium (not purified 30 pg) 
Lane 3: VtgCrFCES (affinity purified; 1 pg) 
Lane 4: VtgCrFCES (ISOprime™ purified; 1 pg) 
5 Figure 3B - cell lysate; 

Lane M: Bio-Rad prestained marker 
Lane 1: control cell lysate (30 pg) 
Lane 2: VtgCrFCES cell lysate (transient; 30 pg) 
Lane 3: VtgCrFCES cell lysate (stable; 30 pg) 
10 The VtgCrFCES (hereinafter "VtgCrFCES" ) protein was 

effectively secreted into the culture medium, as verified by 
SuperSignal HisProbe™ Western Blotting Kit (Pierce) . The 
secreted VtgCrFCES was purified to homogeneity by 
isoelectric focusing (ISOprime™, Hoefer) resulting in a 
15 single protein band having a molecular weight identical to 
that of the protein isolated by affinity column 
chromatography. The ISOprime™-purif ied protein was not 
denatured even when purified in pyrogen-free water. No 
VtgCrFCES was detected in the cell lysate. The purification 
20 process was made easy and more effective by the presence of 
VtgCrFCES in the culture medium. 

Figure 4 shows the lipid A binding properties of 
ISOprime™-purif ied VtgCrFCES. Immobilization of lipid A to 
the HPA sensor chip has been described earlier (Pui et al., 
25 J. Endotoxin Research 4:391-400 (1997)). 

Injection of VtgCrFCES ( 8 OOng/lOOpl ) resulted in a 
signal of -2000 relative response units. This represents a 
-92% saturation of lipid A which clearly indicates that 
VtgCrFCES binds to lipid A. Subsequently, injection of 
30 antibody (INDIA His-HRP Ab) against the poly-His tag of 
VtgCrFCES resulted in a further increase in the signal. The 
binding of INDIA His-HRP Ab further confirms that only 
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VtgCrFCES was bound to the immobilized lipid A. The result 
also indicates that the six additional Vtgss-derived amino 
acids in the N-terminus of the protein do not interfere with 
the proper folding of the recombinant protein and its LPS- 
5 binding property. The properly folded VtgCrFCES is secreted. 
Injections of samples at various steps are marked on the 
sensogram. "Wash" indicates a washing step using pyrogen- 
free water. The relative response units of the signal are 
obtained by subtracting the response units recorded just 
10 before injection of sample from the response units recorded 
after injection of sample and a 2 min wash. 

Figure 5A shows the construction of psp-VtgCAT. The CAT 
gene was isolated by PCR. The start ATG codon of the CAT 
gene was changed to CTG to ensure translation initiation at 
15 Vtgss as indipatecj , v in bold in the sequence of the CAT 



forward primer (5'-GAA GAT CTG CTG GAG AAA AAA ATC ACT GG- 



the CAT reverse ^Yimer (5 ' -GC ATC GGC CGT GCC TTA AAA AAA 



20 cycle (94 °C/5 min; 50 °C/1 min; 72 °C/1 min), 29 cycles (94 
°C/45 sec; 50 °C/30 sec; 72 °C/30 sec) and final extension at 
72 °C for 5 min. The PCR product of -675 bp was purified 
using a Qiaquick™ PCR Purification Kit and subcloned into 
the Hinc II site of pBluescript™ II SK. The resultant 

25 construct is named pBSCAT. The CAT gene was released by Bgl 
II and Not I digestion and subcloned into compatible Bam HI 
and Not I sites of pVtgEGFP. The CAT gene harboring Vtgss 
(VtgCAT) was subsequently released by using Eco47 III and 
Not I digest and recloned into the Eco RV and Not I sites of 

30 pBluescript™ II SK to form the vector pBSVtgCAT . This 
recloning step serves two purposes: (1) the sequences 
between Vtgss and CAT can be determined using a T7 primer; 




gene is indicated in bold in 



3 1 ) . The stop 




TTA CGC-3'). The temperature profile used includes a 1 



St 



Attorney Docket 



1781-178P 



-10- 



(2) this creates a Hind III site 5' of VtgCAT and a Cla I 
site 3' of VtgCAT for future subcloning into the pSEAP- 
Promoter vector^-^;/^ 

Figure SB^shows the details of the Vtgss-CAT fusion in 
5 the pBSVtgCAT vector. 

Figure 5C shows the plasmid map of the pSEAP-Promoter 

vector • gfflSb /to*-' 1% 1^?) 

Figure SD^shows the amino acid and nucleotide sequences 
adjoining Vtgss and CAT in the vector psp-VtgCAT. The 

10 sequences between Vtgss and CAT were verified by Taqtrack™ 
sequencing (Promega) using a T7 primer. Both EcoRV/Eco47 III 
and Bgl I I /Bam HI sites were destroyed. 

Figure 5E shows the detailed plasmid map of psp-VtgCAT. 
The VtgCAT gene was subcloned downstream of the SV40 early 

15 promoter. The unique restriction enzymes in the multiple 
cloning site (MCS) were as illustrated. This construct can 
be used to analyze enhancer sequences cloned into any one of 
the unique sites in the vector. 

Figure 5F shows the plasmid map of ERU-psp-VtgCAT . The 

20 estrogen response unit (ERU) of 565 bp from the Xenopus 
vitellogenin Bl gene was subcloned into the Nhe I and Bgl II 
site of psp-VtgCAT. The Nhe I/Xba I site was destroyed. The 
transcription of VtgCAT is under the influence of estrogen 
in this construct. 

25 Figure 5G shows restriction digests of various 

constructs . 

Lane 1: 100 bp marker 

Lane 2: ERU-psp-VtgCAT/Hind III and Cla I 
Lane 3: psp-VtgCAT/Hind III and Cla I 
30 Lane 4: pSEAP-Promoter /Hind III and Cla I 

Lane 5: cER in pSGl/Hind III 
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The VtgCAT (-830 bp) fragment can be released by Hind 
III and Cla digestion. 

The upper band for ERU-psp VtgCAT is slightly larger 
than psp-VtgCAT because it contains a -560 bp ERU. 
5 Figure 6 shows the secreted VtgCAT expression profile 

over time in COS-1 cells. The COS-1 cells were cotransf ected 
with ERU-psp- VtgCAT, pSG cER (chicken estrogen receptor 
expression vector) and pSEAP-Control in a ratio of 6:3:1. 
The result illustrates several important features. First, 

10 the recombinant VtgCAT was effectively secreted and 
accumulated in the culture medium. Second, the secretion of 
VtgCAT was not delayed. The uninduced cells (open box) 
exhibited only a marginal increase in VtgCAT over the period 
of 24 h. For induced cells (solid box), the increase in the 

15 production and secretion of VtgCAT can be detected as early 
as 2 h after the E 2 -induction. A peak 7-fold increase (-1400 
]Lig VtgCAT/U SEAP activity/h) in VtgCAT was observed 12 h 
post-induction as compared to the corresponding control. By 
24 h, the VtgCAT decreased to -1000 |ig VtgCAT/U SEAP 

20 activity/h. The secreted VtgCAT was assayed using CAT ELISA 
(Boehringer Mannheim) SEAP was measured as described in Tan 
et al., Mol. Cell. Endocrinol. 123 :146-161 (1996)). The 
VtgCAT amount was normalized using SEAP activity. lxlO" 8 M or 
estradiol was used for induction in all experiments. 

25 Figure 7 shows a Northern blot analysis of E 2 -induced 

VtgCAT expression for ERU-psp-VtgCAT . Total RNA was isolated 
from the COS-1 cells for E 2 -induced and uninduced cells using 
RNAeasy™ Mini Kit (Qiagen) . Ten \iq of RNA was resolved on a 
1.2 % f ormaldehyde/agarose gel and alkaline transferred onto 

30 Hybond-N+™ membrane (Amersham) . The membrane was probed 
initially with VtgCAT, stripped and reprobed with a mouse 
actin gene. The result indicates that the levels of VtgCAT 
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in the culture medium are directly proportional to changes 
in intracellular concentration of VtgCAT mRNA. Thus, Vtgss 
can be utilized for reporter gene studies since there is no 
delay in its production and secretion. An actin cDNA probe 
5 was used to normalize the result. 

Lane 1 : Oh 

Lane 2 : 2 h 

Lane 3 : 4 h 

Lane 4 : 6 h 
10 Lane 5: 12 h 

Lane 6: 24 h 

Figure 8A shows the construction of pVtgEGFP. The Vtgss 
was isolated by PCR using . a vector T3 primer and a 
OaVtgExon2 reverse primer^ (5 -CCAAGTTGGACTGGTCCCCCA-S ' ) 

15 using pOaVtg 1 as a template. The PCR conditions used 
includes a 1 st cycle (94 °C/5min; 52 °C/1 min; 72 °C/1 min) , 
29 cycles (94 °C/lmin; 52 °C/30 sec; 72 °C/30 sec) and final 
extension of 72 °C 5 min. The PCR product was purified using 
Qiaguick™ Nucleotide Purification Kit (Qiagen) and digested 

20 with Sac I. This blunt-end (from the PCR) and Sac I fragment 
was subcloned into the Sma I and Sac I sites of pEGFP-Nl 
(Clontech) . Consequently, the Sma I site of pEGFP-Nl was 
destroyed. To reduce the distance between the 
transcriptional start site and the promoter, the clone was 

25 digested with Eco RI and Bgl II, Klenow end-filled, and 
religated to give ^ 
Figure 8B ^sfioi 
the vector pVtgEGFP. 

Figure 8C shows the detailed plasmid map of pVtgEGFP. 

30 The VtgEGFP was subcloned downstream of the CMV promoter. 
Several unique sites flank both the Vtgss and EGFP to allow 
more flexibility in cloning. The sequence junction of Vtgss 
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EGFP was determined using Taqtrack™ sequencing and an EGFP 
reverse priiteiP^^^S' -CCCTCGCCGGACACGCTGA-3 ' ) . The 

transcriptional start site was determined by Teo et al . ("A 
Novel Piscine Vitellogenin Gene: Structural and Functional 
Analyses of Estrogen-Inducible Promoter", <J. Mol. Cell. 
Endocrin. (1998), in press). 

Figure 8D shows western blot analysis of VtgEGFP 
expression in COS-1 cells. Transient transfection of COS-1 
cells with pVtgEGFP was performed using Lipof ectamine™ 
(Gibco) . Forty-eight hours post transfection, both the 
medium and cell lysate were sampled. The result indicates 
that majority of VtgEGFP was secreted into the culture 
medium. A weaker immunoreactive band was also identified in 
the cell lysate using GFP antibodies. This is most likely 
due to the translation initiation at the start ATG codon of 
native EGFP. This shows that Vtgss can direct secretion of 
another reporter gene, EGFP. However, the optimal result for 
Vtgss-directed secretion can be achieved if the start codon 
of the EGFP gene is replaced or removed so that translation 
begins at the start codon of the Vtgss. 

Lane M: Bio-Rad Kaleidoscope™ marker 
Lane 1: control culture medium (24 h, 30 \xq) 
Lane 2: VtgEGFP culture medium (24 h, 30 ]ig) 
Lane 3: VtgEGFP culture medium (48 h, 30 pg) 
Figure 9 shows the secreted VtgCAT expression profile 
over time in NIH/3T3 (normal mouse fibroblast) and CHO-B 
(Chinese hamster ovary) cells. NIH/3T3 and CHO-B cells were 
cotransf ected with ERU-psp-VtgCAT, pSG cER (chicken estrogen 
receptor expression vector) and pSEAP-Control in a ratio of 
6:3:1 using Lipof ectamine™ (Gibco). The result shows that 
recombinant VtgCAT was effectively secreted and accumulated 
in the culture medium. Thus, secretion of VtgCAT was not 
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limited to COS-1 cells; estrogen-induced expression of ERU- 
psp'-VtgCAT can also be observed in other mammalian cells. 
The uninduced cells (open box) exhibited only a marginal 
increase in VtgCAT over the period of 2 4 h. For induced 
5 cells (solid box), the increase in the production and 
secretion of VtgCAT can be detected in the earliest 
indicated time point of 6 h after the E 2 -induct ion . A peak 
showing 4-fold (-400 jag VtgCAT/U SEAP activity/h) and 5.5- 
fold (-550 jag VtgCAT/U SEAP activity/h) increase in VtgCAT 

10 were observed for NIH/3T3 and CHO-B cells, respectively. In 
contrast to COS-1, the lower level of induction is due to 
the lack of the large T-antigen which would otherwise 
amplify the signal. Importantly, the results indicate that 
the secretion of VtgCAT is not limited to a particular cell 

15 type. The secreted VtgCAT was assayed using CAT ELISA 
(Boehringer Mannheim) . SEAP was measured as described by Tan 
et al. Mol. Cell. Endocrinol. 123:146-161 (1996)). The 
VtgCAT amount was normalized using SEAP activity. 

Figure 10 shows the secreted VtgCAT expression profile 

20 over time in EPC (carp epithelial cells), a piscine cell 
line. (Open box, uninduced; solid box, estrogen-induced 
cultures.) A detectable amount of VtgCAT can be observed at 
6 h. The EPC cells were cultured at a low temperature of 25 
°C. A 3-fold increase in VtgCAT can be detected by 24 h after 

25 estrogen-induction. The efficient production and secretion 
of VtgCAT was again demonstrated in this piscine cell line. 
The low metabolic rate of EPC would conceivably produce less 
VtgCAT, but this does not affect the secretion efficiency 
conferred by the SS of the invention. The secreted VtgCAT 

30 was assayed using CAT ELISA (Boehringer Mannheim) . SEAP was 
measured as described in Tan et al . , Mol. Cell. Endocrinol. , 
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123:146-161 (1996)). The VtgCAT amount was normalized using 
SEAP activity. 

Figure 11 shows thei construction of pYEX-VtgCAT. The 
pYEX- VtgCAT vector basically used the backbone from pYEX-Sl 
(Clontech) except that the original K. lactis killer toxin 
signal sequence was replaced by Vtgss. Due to the limited 
number of cloning sites in the original pYEX-Sl, an 
intermediate vector pBSPGK was constructed. pBSPGK consists 
of the PGK promoter, released via Hind III and Eco RI 
digestion from pYEX-Sl, and subcloned into the corresponding 
sites of pBluescript™ II SK. This construct is a very useful 
intermediate as it provides more unique sites for cloning 
genes of interest. The VtgCAT fragment was released from 
pVtgCAT by Eco47 III and Eag I digestion and subcloned into 
the Sma I and Eag I sites of pBSPGK, respectively. The 
entire fragment harboring the PGK promoter and VtgCAT was 
released by Hind III and Sac I digestion and subcloned into 
the similar site of pYEX-Sl. Consequently, expression of 
pYEX- VtgCAT was driven by the strong constitutive PGK 
promoter and transcription terminated by the PGK terminator. 

Figure 12 shows the expression profile of VtgCAT in two 
different yeast transf ormants over 72 hours. The construct 
pYEX- VtgCAT was transformed Into S. cerevislae strain DY150 
using the single-step transformation method described by 
Chen et al . , Curr. Genet. 21:83-84 (1992)). The 
transf ormants were selected on synthetic minimal medium agar 
containing all the required supplements except uracil. 100 
ml YEPD medium (pH 5.0) contained in a 500 ml baffled flask 
was inoculated with a single colony of pYEX- VtgCAT and grown 
for 16 h at 30 °C with vigorous shaking. Subsequently, 2 X 50 
ml of the yeast were collected independently by 
centrifugation for 10 min at 800 x g. One 50 ml pellet was 
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resuspended in 200 ml YEPD medium and grown for 72 h at 30 °C 
with vigorous shaking in a 1 L baffled flask. The other 50 
ml pellet was resuspended in 200 ml minimal medium (MM) and 
grown as above. At 24, 48, and 72 h, two ml aliquots of the 
culture were removed. The yeast and medium were separated by 
centrifugation. The cell lysate was obtained by lysing the 
yeast pellet (resuspended in phosphate buffered saline) with 
glass beads, while the culture medium was collected and 
frozen without any pre-treatment . The pH of the culture was 
also monitored using appropriate universal pH indicator 
paper and adjusted to pH 5.0 using 1 M potassium phosphate 
buffer (pH 8.0). The amount of VtgCAT secreted into the 
culture medium and in the cell lysate were assayed using CAT 
ELISA (Boehringer Mannheim) . Total soluble protein was used 
to normalize the data. 

Fiqure 13 shows the construction of pBADVtgblactKana. 
The ft-lactamase gene was isolated by PCR usxng bl^tgori (5 - 
CCGGGATCCAGAAACGCTGGTGAAAGTAA-3' ) and blactr^iiTmer^ ( 5 ' - 
GCGGCCGTTACCAATGCTTAATCAGTGAG- 3 ' ) using pBluescript™ II SK 
as a template. The PCR primers were designed to exclude the 
native il-lactamase secretory signal. The PCR conditions used 
include a first cycle (94°C/45 sec; 50°C/30 sec; 72°C/1 
min) , 29 cycles (94°C/45 sec; 50°C/1 min; 72°C/30 sec) and 
final extension (72°C/5 min) . The approximately 790 basepair 
PCR product was purified using a Qiaquick™ PCR Purification 
Kit (Qiagen) and digested with Bam HI and Eag I. This BamHI- 
Eagl PCR fragment was subcloned into the BamHI and Not I 
sites of pVtgEGFP. To maintain optimal distance between the 
promoter and the ATG start codon, a second round of PCR was 
performed using Bsp^sTor^?' -GGGTCATGAGGGTGCTTGTACTAGCTCTT- 
3') and blactrev primers and using pVtgblact as template. 
The second PCR follows a thermal regime of a first cycle 
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(94°C/5 min; 60°C/1 min; 72°C/1 min) , 29 cycles (94°C/45 
sec; 60°C/30 sec; 72°C/30 sec) and a final extension of 
72°C/5 min. The resulting PCR fragment of about 865 

TM 

basepairs was purified as above and cloned into Bluescript 
II SK. This Vtg-fi-lactamase fragment was released by BspHI 
and EcoRI digestion and subsequently subcloned into the Ncol 
and EcoRI sites of pBAD/myc-His B. The vector's li-lactamase 
gene was released by BspHI digestion, and the vector ends 
were blunt-ended by Klenow enzyme. Then a kanamycin 
resistance gene obtained as a 2461 basepair Dra I fragment 
from pGFP-N3 -was inserted into the blunt-ended vector. The 
resulting plasmid, named pBADVtgblactKana was transformed 
into E. coll. LM194 competent cells. (0$J&fa\1^ (0j^k[ 

Figure 14A shows the amino acid^ and nucleotide^ 
sequences at the junction of Vtgss and fi-lactamase in 
pBADVtgblactKana . 

Figure 14B shows the plasmid map of pBADVtgblactKana. 
The Vtg-fi-lactamase gene was cloned downstream of the araBAD 
promoter (pBAD) . The translation initiation site of Vtgss 
was constructed so that it is 9 basepairs from the optimized 
ribosome-binding site. The sequence junction of Vtg-B- 
lactamase was determined using Taqtrack™ sequencing. Both 
the BspHI site of Vtg-ft-lactamase and Nco I site of the 
pBAD/myc-His B vector were destroyed. 

Figure 15 shows a plating assay of Vtg-fi-lactamase 
induction. In the absence of inducer or at very low levels 
of inducer (arabinose) , no colonies are observed. Increasing 
amounts of inducer provides for survival of increasing 
numbers of colonies. Thus, when a selectable marker gene is 
used as a reporter gene," the plating assay provides an easy 
semi-quantitative assay for reporter gene activity. 
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Figure 16 shows the growth profile of the bacteria 
after induction with arabinose. The growth profile was 
monitored at O.D. 60 onm. It is clear from the graph that a 
pBADVtgblactKana clone grown in RM medium (■) with 0.2% 
glucose (i.e., no induction) exhibited a growth profile 
similar to the control LM194 host bacteria (•) . Addition of 
arabinose, at various concentrations, affected the growth of 
the pBADVtgblactKana clone. Best growth is seen only when 
0.0002% arabinose was used (A). Higher concentrations of 
arabinose (> 0.02%) resulted in a decrease in cell density. 
This is probably due to either (1) compromised growth of the 
cells due to overexpression of Vtg-fi-lactamase or (2) 
toxicity due to a high level of Vtg-fi-lactamase activity. 
The decrease in cell density has a pronounced effect on 
secretion of Btg-fl-lactamase . The induction procedure was as 
described by the manufacturer (InVitrogen) . 

Figure 17 shows the expression profile of Vtg-fi- 
lactamase in the periplasmic space of bacteria. From the 
graph, no Vtg-ft-lactamase was produced when a 
pBADVtgblactKana clone was cultured in RM medium with 0.2% 
glucose (•) . This is expected since no inducer is added. 
Addition of arabinose resulted in expression and 
accumulation of Vtg-fi-lactamase in the periplasmic space. 
The early expression profile is dose-dependent, with the 
highest expression observed when 0.2% arabinose was used 
(♦). However, by 4 h, this dose-dependent profile is not 
observed. This could be directly due to the growth 
retardation as seen in Figure 15. Importantly, the 
recombinant Vtg-fl-lactamase is functional and exhibited 
properties similar to native ft-lactamase when measured using 
the standard colorimetric assay for ft-lactamase (Cohenford 
et al., Anal. Biochem., 168, 252-258 (1988)). The 
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periplasmic space fraction was isolated as described by 
Laforet and Kendall (J. Biol. Chem. , 266, 1326-1334 (1991)). 
The values obtained from the various time points for 
pBADVtgblactKana were subtracted from those of the 
5 corresponding LM194 controls. 

Figure 18 shows the expression profile of Vtg-B- 
lactamase in culture medium. From the graph, no Vtg-fi- 
lactamase can be detected when a pBADVtgblactKana clone was 
grown in 0.2% glucose (•) . Again, this is expected since no 
10 inducer is present. Surprisingly, Vtg-fl-lactamase was 
detected in the culture medium, although not at as high a 
level as in the periplasmic space. But Vtg-ft-lactamase in 
the culture medium was measured in the dilute medium. Even 
more surprising is that the Vtg-fl-lactamase expression 
15 profile is reversed as compared with the Vtg-ft-lactamase 
accumulation in the periplasm. The highest concentration of 
Vtg-ft-lactamase was detected in the medium when the lowest 
concentration of 0.0002% arabinose was used in the culturing 
(■) . Obviously, the ratio of induction and growth profile of 
20 bacteria are important factors. Rapid and high accumulation 
of Vtg-B-lactamase in the periplasmic space does not 
translate into efficient secretion into the medium. Instead, 
low-level induction is preferred. The medium was clarified 
of bacteria by centrif ligation- at 4°C for' 10 min at 12,000 
25 xg . The filtered medium was kept in ice until all thel 
samples were collected for assay. 

Figure 19 shows the cloning strategy of pVtg-Gal . The 
ft-galactosidaae coding sequences was PCR amplified using 



:osidase . cp 



ForBamGal^ ( 5 ' -CCATGGATCCCGTGATTTCGTTGCCGGTCT-3 ' ) and 
30 RevEagGaJ^^f5^^CGACGGCCGGGCAGACATGGCCTGC-3' ) using pfigal- 
promoter (Clontech) as template. The PCR product was digest 
with Ban HI and Eag I and ligated into similar sites in 
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pVtgblact . 



Using pBluescript II SK as an intermediate 



vector, a -1.3 kb fragment harboring Vtgss and the 5' coding 
region of fi-galactosidase was subcloned into pfigal-promoter 
at the Hind III and Eco RV sites. This vector expressing 
secreted ft-galactosidase was designated pVtg-Gal . 

Figure 20 shows western blot analysis of Vtg-Gal using 
mouse anti-ft-galactosidase . Fifty ug of medium was loaded 
and electrophoresed on a 10% SDS-PAGE gel. Lane 1: Day 5 
medium; Lane 2 : Day 4 medium; Lane 3 : Day 3 medium; Lane 
4: Day 2 medium; Lane 5: control medium; Lane 6: Blank; 
Lane 7: 20 ug of cell lysate from day 5 culture. The 
western blot was developed using Goat anti-mouse -HRP and 
Chemiluminescence Substrate . 

Figure 21 shows the rate of secretion of VtgCAT and 
Vtg-Gal in comparison with SEAP. COS-1 cells were 

transfected with 1 |ag of Vtg-fusion construct: control 
vectors in a ratio of 8:2 by liposome-mediated transfection 
with 5-6 jj.1 of Lipof ectamine® reagent (Gibco-BRL) and 
incubated at 37°C for 5 h as described by the manufacturer. 
After 36 h, the medium from cells of each time point was 
removed and replaced with 1 ml of fresh medium at an 
interval of 15 min over a period of 2 h. After the last 
time point, which should represent 0 min, an additional 1 h 
incubation was employed for all transf ections to avoid low 
reading variations. At the end of incubation, the medium 
was retrieved and transferred to a microcentrifuge tube. 
Centrif ugation was carried out as 12, 000 x g for 30 sec to 
pellet any detached cells present in the culture medium. 
100 [il of supernatant was used for ELISA. SEAP was assayed 
using Great Escape™ SEAP Fluorescence Detection Kit 
(Clontech) as described by the manufacturer. Both SSCAT and 
SS-Gal were assayed using CAT ELISA and ft-Gal ELISA 
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50 



(Boehringer Mannheim), respectively. For better comparison, 
the CAT ELISA and fl-Gal ELISA were adapted for fluorescence 
using DIG Fluorescence Detection ELISA (Boehringer 
Mannheim) . The procedures were as described by the 

manufacturer. SEAP was determined using a fluorimeter 
(Perkin Elmer) at Exseo™ and EM 449nm . VtgCAT and Vtg-Gal were 
detected at Ex 440n m and EM 5 50nm- 

DETAILED DESCRIPTION OF THE INVENTION 

The present inventors have discovered that the portion 
of the amino acid sequence of vitellogenin that directs 
secretion of the protein from the cell can be linked to any 
other polypeptide and so direct secretion of that protein 
from a host cell. The secretory signal sequence of the 
invention, herein called "SS" or "Vtgss", is found in the 
amino-proximal portion of a vitellogenin protein. 

The preferred SS of the invention is obtained from a 
piscine vitellogenin protein, preferably from the fish 
Oreochromis aureus. There is a family of vitellogenin genes 
in this organism, which apparently comprises four subgroups 
of genes identifiable by different restriction maps of 
cloned cDNAs (J. L. Ding et al., Biochem. Intl. 20:843-852 
(1990)). Melting of heteroduplexes formed between the cDNAs 
suggests that the overall nucleotide sequence variation 
among them is about 7%. The 5' end of the cDNAs show strong 
homology across all four subgroups by Southern blotting (B. 
H. Lee et al . , Biochem. Mol. Biol. Intl. 34 :75 " 83 (1994)). 
The most preferred SS of the inventi $^^Jg|5 haVing thS 
amino acid sequence MRVLVLAL ^^^^8^ LC \' 

The sequence MRVLVLALAVALAVGDQSNLG^ can be modified by 
amino acid replacements and by insertions or deletions from 
the sequence. The SS may have additional individual amino 
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acids or amino acid sequences inserted into the polypeptide 
in the middle thereof and/or at the N-terminal and/or C- 
terminal ends thereof so long as the SS retains the 
biological activities of directing secretion of a fusion 
protein from a eukaryotic cell and cleavage of the secretory 
signal sequence from the fusion protein at a site between 
the G and D residues, or between any amino acids substituted 
for these positions. Likewise, some of the amino acids may be 
deleted from the SS or substituted, so long as the resulting 
amino acid sequence retains those biological activities. 
Amino acid substitutions may also be made in the sequences so 
long as the polypeptide possesses the desired biological 
activities. 

The secretion activity of the SS can be arranged by 
measuring the proportion of a reporter protein secreted into 
the medium of cultured recombinant cells expressing SS- 
reporter protein fusions. The cleavage activity can be 
assessed by N-terminal sequencing of the secreted protein. 

One or more amino acid residues within the sequence can 
be substituted with another amino acid of similar polarity 
which acts as a functional equivalent, resulting in an 
altered polypeptide that retains the desired biological 
activities. Amino acid substitutions are preferably 
"conservative amino acid substitutions". "Conservative amino 
acid substitutions" are substitutions of one amino acid by 
another amino acid wherein the charge and polarity of the two 
amino acids are not fundamentally different. Amino acids can 
be divided into the following four groups: (1) acidic amino 
acids, (2) neutral polar amino acids, (3) neutral non-polar 
amino acids and (4) basic amino acids. Conservative amino 
acid changes can be made by substituting one amino acid 
within a group by another amino acid within the same group. 
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Representative amino acids within these groups include, but 
are not limited to, (1) acidic amino acids such as aspartic 
acid and glutamic acid, (2) neutral polar amino acids such as 
glycine, serine, threonine, cysteine, tyrosine, asparagine 
5 and glutamine, (3) neutral non-polar amino acids such as 
alanine, valine, leucine, isoleucine, proline, phenylalanine, 
tryptophan and methionine, and (4) basic amino acids such as 
lysine, arginine and histidine. 

Proline is considered to be a less favored substituting 

10 amino acid with respect to the SS of the invention, due to 
its tendency to "break" alpha-helical secondary structure of 
peptides. Also, it is preferable to retain a basic amino 
acid, preferably lysine or arginine, at the second residue of 
the SS. Also, it is preferred that the G and D residues 

15 constituting the cleavage site in the SS be retained; any 
substitution of these two amino acids is preferably A or V 
for the G and E for the D. 

Any integral number of additions or deletions of amino 
acids between 0 and 4 can be made to the SS; preferably only 

20 1 or 2 additions or deletions are made. It is preferred that 
the stretch of hydrophobic amino acids in the interior of the 
SS be - kept in the range 10-15 amino acids long, preferably 
11-14, most preferably 11, 12 or 13 amino acids long. 

The number of substitutions that is tolerated is larger. 

25 An important aspect of the SS is that the hydrophobic stretch 
in its interior portion be retained. Many of the non-polar 
amino acids in this portion can be substituted with other 
non-polar amino acids, especially those having hydrocarbon 
side chains, without affecting the activity of the SS. 

30 Preferably fewer than 7 of the amino acids in the SS, more 
preferably fewer than 3, are substituted overall. 
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g^ie DNA enc oding the ftp r>£ — fefee — ariweirbiran — can oT course^ 


vary with respect to degeneracy of the genetic code. A 
preferred nucleotide sequence encoding the of the 

invention is att cac ate cac cag cc atg ag^tg ^ at|^a 
get ctt get gtg get etc gca gtg ggg gap^cag tec aac £tg ggg,- 
Also, in embodiments of the invention wherein DNA encoding a 
SS of the invention is joined to DNA encoding a desired 
protein to be expressed, ^the joining should be done so that 
the reading frame of^tf'he SS and of the desired protein are 



the same. The joining can be performed so that the 3' end of 
the DNA encoding the SS is joined directly to the 5' end of 
the DNA encoding the desired protein. Alternatively, the 3' 
end of trfie' DNA encoding the SS can be joined to a "linker" 
DNA tltfat encodes additional amino acids, which linker DNA is 
th^n joined also to the 5' end of the DNA encoding the 
"aesi red protei n . 

In some embodiments of the invention, the desired 
protein is a "reporter protein" that has an easily assayed 
biological activity. The biological activity of the reporter 
protein is preferably closely correlated with the amount of 
the transcript encoding the reporter protein present in the 
cell, and in the most preferred embodiment, reflects the 
transcription activity of the promoter driving expression of 
the reporter protein. Examples of reporter proteins for 
which cloned DNAs and quantitative assays are known are 
chloramphenicol aminotransferase, green fluorescent protein 
or another aequorin, p-amylase, p-lactamase, luciferase, 
glucuronidase, alkaline phosphatase and I3-galactosidase . 

A biosensor can be made by making a construct 
comprising the SS and a desired protein operatively linked 
to a promoter sensitive to a particular chemical compound. 
"Operative linkage" of the promoter occurs where the 
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transcription of the SS-desired protein cassette is driven 
by the promoter in the fashion normal for the promoter. 
Thus, if the promoter is a constitutive one, then 
transcription is constant. If the promoter is chemically- 
5 inducible or suppressible , then the transcription of the SS- 
desired protein cassette is similarly chemically-inducible 
or suppressible. 

To complete the biosensor, a cell that intracellularly 
expresses or bears on its surface a receptor for the 

10 compound being sensed is transformed with the construct 
comprising the relevant promoter operatively linked to the 
SS-desired protein cassette. The cell surface receptor is 
preferably one that is endogenous to the cell, since then 
one can presume that the additional proteins required for 

15 transduction of the compound binding signal to the nucleus 
are present in the cell. In instances where the cell surface 
receptor is introduced by recombinant DNA methods, then the 
practitioner of ordinary skill will understand that a host 
cell that harbors the relevant signal transduction proteins 

20 should be used. 

Cells bearing receptors for estrogen and estrogen-like 
compounds ("estrogen mimics") are preferred cells for 
creating biosensors of the invention. The estrogen signal 
transduction pathway is among the better understood ones, 

25 and several cell types that support estrogen-induced 
transcription of exogenous genes introduced by 
transformation are known (Tan et al., "Temperature- 
Dependence of Estrogen Binding: Importance of a Subzone in 
the Ligand Binding Domain of a Novel Piscine Estrogen 

30 Receptor"; Dana et al . , "Novel Estrogen Response Elements 
Identified by Genetic Selection in Yeast are Differentially 
Responsive to Estrogens and Antiest rogens in Mammalian 
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Cells") . Also, several cloned estrogen-responsive elements 
{"EREs") are known in the art (see, for example, N.S. Tan et 
al., Mol. Cell. Endocrinol. 123 :149-161 (1996)). 

The biosensors according to the invention allow easy 
assay for the presence of the relevant compound in a sample. 
The assay is performed by culturing cells constituting the 
biosensor in a medium. A sample of the medium is taken and 
assayed for the activity of the reporter protein encoded by 
the cassette to obtain a baseline level. Then the sample is 
introduced into the culture medium and the culture is 
continued. The medium is again sampled and the activity of 
the reporter gene measured. Changes in the activity of the 
reporter gene will reflect the presence of the relevant 
compound in the sample. Of course, alternate embodiments, 
such as separate, matched cultures of the biosensor cells in 
the presence and absence of the sample, can be envisioned. 
The particular media used are of course dictated by the 
cells used to construct the biosensor; media and methods of 
cell culture are generally known in the art. 

In a similar manner, the SS of the invention can be 
used to make expression cassettes for any desired protein. 
The protein will be secreted into the medium of the culture. 
Use of serum-free culture media provides conditions where it 
is possible that the desired protein will be one of only a 
few, and possibly the only, polypeptide present in the 
culture medium. This greatly simplifies purification of the 
desired protein. 

tft — other — embodiments — — the invpnt i nn/ it io docirablo - 
to produce a lipopolysaccharijd^iJiriding protein, such as 
Factor C of a horse^J^crab. Cloned cDNAs encoding Factor C 
from at 1 ga-*rf two species of horseshoe crab are known (see, 
U^S^" Patent 5, 716, 834 and Muta et al., J. Biol. Chem. 
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266:6554-6561 (1991)). The lipopolysacch^Tdl^binding 
protein produced by the present inventip^^can be used in the 
ways described in U.S. Patents ^^Kl r \A^ and 5,716, 834 and 
in co-pending applications 0^96, 4 05 and 09/081,767. In 
those embodiments of th^^invention directed to production of 
a protein for qualitative assay or purification, it is most 
desired that crells that do not secrete proteases are used. 
Preferred l/^st cells are COS, CHO, NIH/3T3, Drosophila 
cells, /especially Schneider 2 cells, piscine epithelial 
ceIVs (EPC) and yeast cells, such as S. cerevisiae and S. 
p^mbe and Pichia spp., especially protease-def icient yeast 
cells - . 

The present invention can also be used in an assay for 
heterologous gene expression. In this embodiment of the 
invention, cells are transformed with a vector that directs 
the expression of the heterologous protein to be expressed. 
The nucleic acid encoding the heterologous protein is joined 
to nucleic acid encoding the SS, thereby obtaining secretion 
of the expressed protein into the medium upon culture of the 
host cells. The presence of the heterologous protein in the 
culture medium can then be detected, for example by assay 
for its biological activity, if such is known, or by an 
immunologic method, if an antibody that specifically binds 
the heterologous gene product is available. Preferred 
methods for immunologic detection are ELISA (in any of the 
various known formats) and Western blotting. Labels for 
detection of antibody binding are not limiting of the 
method, but preferably the antibody is f luorescently labeled 
or enzymatically labeled. 

The SS of the invention can also be applied to 
"enhancer trap"-type vectors, wherein a promoterless vector 
carrying a reporter gene is transformed into a cell, where 
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it integrates at a random point in the genome in such a 
fashion that integration near a promoter results in 
expression of the reporter gene . In this embodiment, the 
reporter gene is made to incorporate the SS, thus expression 
of the reporter gene will result in secretion of the 
reporter protein out of the cell into the culture medium, 
thus simplifying screening for insertion in the vicinity of 
a promoter. 

The SS of the invention can also be applied to 
expression of genes in chimeric or transgenic animals, to 
obtain secretion of a desired protein, especially of a 
therapeutic protein, into an extracellular compartment. In 
some embodiments, this method provides general dissemination 
of the protein throughout the animal. In preferred 
embodiments, the protein is secreted into extracellular 
matrix, which can thus result in local dissemination if the 
half-life of the protein is short or if it is made to bind 
to receptors on the surface of cells or to receptors in the 
matrix. Secretion into the extracellular matrix can also 
provide quite extensive dissemination if the protein has a 
long half-life and is able to diffuse through the matrix. 

In other preferred embodiments, the protein is made to 
be secreted into the bloodstream, the lymphatic system or 
into the ducts of the breasts for secretion in milk by 
transformation of the appropriate cells. 

Example 1: Cloning of Vtgss in LPS-binding domain of CrFC 
gene (CrFCES) for expression and secretion of 
recombinant ES protein from Drosophlla cells 

The O. aureus vitellogenin gene secretory signal 
(Vtgss) was fused upstream to the EcoRI-Sall cDNA fragment 
encoding the LPS-binding domain of Carcinoscorpius 



Attorney DockeJ^^p. 1781-178P 



-29- 

rotundicauda Factor C CrFCES (Ding et al . , 1995. Mol. Marine 
Biol. & Biotechnology -4:90-103). The construct was inserted 
into the pAc5/V5-His vector backbone (InVitrogen, see Fig. 
1) . 

5 Vitellogenin is the yolk protein of the oviparous and 

ovoviviparous animals. It is exclusively synthesized in the 
liver, secreted and finally deposited in the eggs. The 
secretion of vitellogenin is directed by its secretory 
signal. The expression of the LPS-binding domain of Factor C 

10 (-333 amino acids) is challenging because it is rich in 
cysteine residues and requires the correct formation of 9 
disulfide bonds for full biological activity. The 
recombinant protein ( VtgssCrFCES ) is synthesized using the 
Drosophila Expression System™ (InVitrogen) . Stable 

15 recombinant Drosophila cell lines were obtained and the 
distribution of the protein was identified by Western 
blotting using chemiluminescent detection. The recombinant 
protein was also purified to homogeneity for N-terminal 
amino acid sequencing . 

20 Homogeneous cleavage between Vtgss and the adjoining 

mature CrFCES protein was demonstrated by N-terminal amino 
acid sequencing of VtgCrFCES. The data show that Vtgss is 
cleaved at a single point (Fig. 2) . The absence of multiple 
cleavages shows that Vtgss directs homogenous production of 

25 secreted heterologous protein. 

The Western Blot shows that practically all of 
VtgCrFCES is secreted into the culture medium (Fig. 3) . No 
intracellular VtgCrFCES was detected using chemiluminescent 
detection, which is very sensitive. The fusion of Vtgss to 

30 the LPS-binding domain resulted in its secretion whereas the 
native secretory signal of Factor C targets the LPS-binding 
domain intracellularly . The presence of VtgCrFCES in the 
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culture medium allows simple purification from either batch 
or continuous culture. 

The LPS-binding properties of the secreted protein are 
retained as shown by plasmon resonance studies, using the 
BIACORE™ in vivo biosensor apparatus (Fig. 4). This is 
despite addition of 6 Vtgss-derived amino acids to the N- 
terminus of the CrFcES protein. Thus Vtgss is capable of 
driving the secretion of correctly-folded CrFCES. Functional 
Vtgss truncated after the aspartic acid residue that is the 
carboxy side of the cleavage site (see, e.g., Figure 8A) can 
be used to reduce the number of extra amino-terminal 
residues. 

The expression vector employed here contains the basic 
features necessary for expression of recombinant proteins in 
heterologous cells; (1) an efficient promoter for 
transcription initiation; (2) enhancer elements for high 
level transcription; (3) mRNA processing signals such as 
mRNA cleavage and polyadenylation signals; (4) a polylinker 
containing multiple restriction endonucleases sites for 
insertion of foreign DNA; (5) selectable markers that can be 
used to select cells that have stably integrated the 
expression vector; and (6) plasmid backbone sequences to 
permit propagation in bacterial cells, etc. All eukaryotic 
expression systems known to date harbor several of the 
above-mentioned features (Kaufman, R. J., Methods in 
Enzymol. 185 :487-511 (1990); Kriegler. M., Methods in 
Enzymol. 185 : 512-526 (1990)). Furthermore, all eukaryotes 
use similar mechanisms for the extracellular secretion of 
proteins (Barritt, G. J., ed . 1992 in "Communication within 
Animal Cells", pp. 42-48; Lewin, B. ed., 1994 in Genes V, 
pp. 289-293) . Consequently, the rapid secretion potential 
conferred by Vtgss is not limited to a Drosophila expression 
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system. Other expression systems based on eukaryotic host 
cells (e.g., other Drosophila host cells; baculovirus 
systems; other eukaryotic host cells, including mammalian, 
avian, reptilian, piscine, invertebrate (see Examples 2-4) 
and yeast (see Example 6) ) can be used with the vectors of 
the invention. 



Example 2 : Inducible expression and secretion of the 
recombinant VtgssCAT and VtgssEGFP reporters 
from mammalian cells 

We have thus constructed a reporter CAT system (Figs. 
5A-5G) that uses the secretory signal of the O. aureus 
vitellogenin gene (Vtgss) . The excellent secretory signal 
provides quantitative secretion of the CAT reporter protein 
into the culture medium, and thus it is not necessary to 
prepare a cell lysate to assay for the reporter protein. An 
investigator is thus provided with an easy assay for CAT 
activity and so can retain methodology that he/she has 
previously established in the laboratory. This permits easy 
comparison with data previously obtained using a CAT 
reporter protein. 

The psp-VtgCAT (Figs. 5D and 5E) and ERU-psp-VtgCAT 
(Fig. 5F) vectors were constructed by typical methods. The 
psp-VtgCAT has the pSEAP-Promoter vector backbone, but the 
SEAP gene is replaced by the recombinant VtgCAT gene. The 
ERU-psp-VtgCAT is psp-VtgCAT with the estrogen response unit 
from a Xenopus vitellogenin Bl gene (kindly provided by 
Professor W. Wahli, Switzerland) inserted upstream of the 
SV40 promoter. These constructs were co-t ransf ected together 
with chicken estrogen receptor cDNA and pSEAP control DNA 
into various mammalian cell lines (e.g., CHO, NIH/3T3 & 
COS) . This allows the testing of E 2 -inducibility of the ERU- 
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psp-VtgCAT. The secreted VtgCAT reporter protein was assayed 
by CAT ELISA (Boehringer Mannheim) (Fig. 6) . To verify that 
the level of the secreted VtgCAT in the culture medium is 
directly proportional to changes in intracellular 
5 concentrations of VtgCAT mRNA, a Northern blot was performed 
under E 2 -stimulation over a time course (Fig. 7) . The results 
indicate that VtgCAT is efficiently secreted from the cells, 
and the levels of VtgCAT in the culture medium are directly 
proportional to changes in intracellular concentration of 

10 VtgCAT mRNA. Furthermore, the increase in VtgCAT can be 
detected as early as 2 h after E 2 -stimulation . Importantly, 
■ the decrease in VtgCAT transcript after 24 h is also 
reflected by a lower amount of VtgCAT protein. This 
indicates that there is no delay in VtgCAT secretion. Thus, 

15 the Vtgss signal acts to immediately secrete the protein out 
of the mammalian cells. 

pVtgEGFP, a green fluorescent protein reporter 
construct (Figs. 8A-8C) was transiently expressed in COS 
cells. Vtgss also efficiently secreted the EGFP fluorescent 

20 protein into the culture medium (Fig. 8D) . 

Example 3: Inducible expression and secretion of recombinant 
CAT reporter from other mammalian cells (NIH/3T3 
and CHO-B) 

Estrogen-inducibility of ERU-psp-VtgssCAT was also 
25 demonstrated in other mammalian cell lines. NIH/3T3 (normal 
mouse fibroblast cells) and CHO-B (Chinese hamster ovary 
cells) were transfected as described in Example 3. Three 
time points were assayed for VtgssCAT secretion into the 
culture medium, 0, 6 and 24 h post estrogen induction. Peak 
30 expression of VtgssCAT was at 24 h. NIH/3T3 and CHO-B cells 
exhibited a 4-fold and 5.5-fold increase in VtgssCAT (Fig. 
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9) . The results show that the estrogen-responsiveness, 
production and secretion of VtgssCAT is not limited to COS-1 
cells . 

Example 4 : Inducible expression and secretion of recombinant 
CAT reporter from piscine cells 

Co-transf ection of the ERU-psp-VtgCAT construct with 
chicken estrogen receptor cDNA into a piscine cell line 
(e.g., Carp Epithelial cells, EPC) also resulted in 
estrogen-inducible expression and secretion of the CAT 
reporter protein into the culture medium (Fig. 10) . EPC were 
cultured at a low temperature of 25 °C . A 3-fold increase in 
VtgCAT can be detected by 2 4 h after estrogen-induction. The 
low metabolic rate of EPC likely explains the lower 
induction of VtgCAT compared to the levels observed in 
Example 3. Nonetheless, this experiment shows the efficiency 
of production and secretion of VtgCAT conferred by Vtgss. 
Furthermore, the Vtgss-directed secretion process is 
applicable to all eukaryotic cells; it is not limited to 
mammalian cells. 

Example 5: Expression of VtgCAT reporter in S. cerevlslae 
(strain DY150) 

The VtgCAT reporter was inserted downstream of the 
strong phosphoglycerate kinase (PGK) promoter (Fig. 11) . The 
PGK promoter drives high constitutive expression of a 
heterologous protein. The plasmid provides for two 
selectable traits, URA3 and leu2-d. Expression of VtgCAT was 
monitored in yeast grown in a rich YEPD medium and synthetic 
minimal medium (MM) . From Fig. 12, it is clear that 
production and secretion of VtgCAT occurred as early as 24 
h. The amount of VtgCAT detected in the both kinds of 
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culture medium is comparable. However, VtgCAT production and 
secretion appear to decrease by 48 h, particularly, for 
yeast grown in MM. By 48 h, the pH of the media has dropped 
to below the optimum of 5-6. This affects the growth of the 
transformants, and hence, production of VtgCAT. Adjusting 
the pH back to 5 resulted in a tremendous increase in VtgCAT 
in the culture media. Thus, the growth condition is 
important for efficient secretion of VtgCAT from yeast. We 
envisage that cultivation of recombinant yeast in a 
fermenter enables better control of growth conditions and 
consequently secretion of even higher amounts of VtgCAT into 
the culture medium. It is important to note that "secretion" 
in this application includes export of recombinant 
protein (s) into the periplasmic space, as well as through it 
into the culture medium, of a recombinant host cell that has 
a cell wall. In such instances, the "cell lysate" fraction 
will include proteins localized in the periplasmic 
compartment. In this particular example, a large proportion 
of the VtgCAT is found to accumulate in the periplasm. Use 
of wall-less mutants of yeast (or other fungi) host cells 
would prevent accumulation of the heterologous proteins in 
the periplasm. Such wall-less mutants are known in the art. 

Example 6: Inducible expression and secretion of Vtgss-p- 
lactamase from bacteria 

Ampicillin binds to and inhibits a number of enzymes in 
the bacterial membrane that are involved in the synthesis of 
cell wall. The ampicillin resistance gene carried on a 
plasmid codes for an enzyme (fi-lactamase) that is secreted 
into the periplasmic space of the bacterium, where it 
catalyzes hydrolysis of the fi-lactam ring with concomitant 
detoxification of ampicillin. To demonstrate the 
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universality of Vtgss to direct secretion of recombinant 
proteins in bacteria, the native secretory signal of 15- 
lactamase was replaced with Vtgss. The expression of this 
new fusion enzyme (Vtg-Ii-lactamase) is under the control of 
an arabinose-inducible promoter (e.g., araBAD promoter). 

A ^-lactamase gene was isolated by PCR using a blactfor 
forward primer which was designed to prime the start of the 
mature protein (i.e., to exclude the native secretory 
signal, see, Figures 13, 14A and 14B) . Consequently, the 
only means of survival for pBADVtgblactKana transf ormants 
was when Vtgss correctly directs the secretion of the 
"truncated" ^-lactamase, into the periplasmic space for 
hydrolysis of ampicillin. Sufficient functional fi-lactamase 
must be secreted into the periplasm to effectively hydrolyze 
ampicillin before the latter enters the cell or the host 

cells will die. 

Expression of the ^-lactamase in the periplasm was 
assaye d by plating transf ormants on agar plates containing 
ampicillin and varying amount of arabinose. In the absence 
of arabinose (0.2% glucose) or in the presence of only 
0.0002% arabinose, no colonies are seen. This is due to the 
insufficient expression and/or secretion of Vtg-fi-lactamase 
under these conditions. When the amount of arabinose was 
increased (0.002 to -0.2% arabinose), a dose-dependent 
expression and secretion of Vtg-fl-lactamase was observed 
which is reflected by the appearance of bacterial colonies 
on the agar plates (Fig. 15) . It is envisaged that secretion 
of recombinant protein conferred by Vtgss will also apply 
for both Gram-positive and Gram-negative bacteria, since 
they share similarities in their secretory mechanisms 
(Lewin, B., ed., 1994 in "Genes V", pp. 298-299). 
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The arabinose-induced vector with Vtgss is capable of 
directing secretion of Vtg-fJ-lactamase into the periplasm as 
well as out of the bacteria into the culture medium (Figs. 
16-18), both of which caused the hydrolysis of ampicillin, 
hence the survival of positive clones carrying the Vtgss 
signal. The expression and secretion of Vtg-ft-lactamase in 
this instance is strongly influenced by the bacterial growth 
rate and rate of transcription induction. Nonetheless, the 
Vtgss is a useful system for convenient screening/reporting 
of genetically-engineered clones harboring selectable 
marker (s) /enzymes (such as U-lactamase) to be expressed as 
an indicator. In the present example, the arabinose-induced 
growth of the positive clones in the presence of ampicillin 
selection is a convenient visual screen. 

Example 7: Rate of VtgCAT and Vtg-Gal Secretion in COS-1 
Cells 

The construct, pVtg-Gal, which harbors a truncated B- 
galactosidase gene cloned downstream of Vtgss (Fig., 19) 
produced and secreted the recombinant Vtg-Gal into the 
medium when transf ected into COS-1 cells. Western blot 
analysis using mouse anti-Ii-galactosidase identified a -116 
kDa recombinant Vtg-Gal in the culture medium of transfected 
COS-1 cells (see Fig. 20). Thus, Vtgss has the potential to 
direct secretion of a large heterologous protein. 

The amount of Vtgss-fusion protein secreted at various 
time intervals was determined using a standard curve created 
using the positive control provided by the kits. The rate f 
secretion was determined by the gradient of best-fit line 
when amount of secreted protein was plotted against time 
(Fig. 21). The mean rate of VtgCAT secretion was determined 
to be 165.778 fg/ml VtgCAT G-galactosidase/min + 0.118 
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fg/mlVmin. The mean rate of Vtg-Gal secretion was 

determined to be 12.14 fg/ml Vtg-Gal/ng SEAP/min + 0.091 
fg/ml/min. In comparison SEAP was secreted at a rate of 
4.763 fg/ml SEAP/ng fi-galactosidase/min + 0.061 fg/ml/min. 
5 The rate of VtgCAT and Vtg-Gal secretion were almost 3-fold 
higher than SEAP. This would prove very critical for 
experiments investigating short term or rapid 
transcriptions. Once again, this shows that Vtgss is 
capable of directing rapid secretion of recombinant proteins 
10 regardless of their size. The values for VtgCAT and SEAP 
secreted were normalized by fl-galactosidase production, 
it 3 Similarly, the values for secreted Vtg-Gal were normalized 

with SEAP. 



The above examples are merely illustrative of the 
15 invention and are not to be understood as limiting it. 
Rather, the scope of the present invention is defined by the 
claims following. All references of the patent and 
scientific periodical literature cited above are hereby 
incorporated in their entirety by reference by such 
20 citation. 




